ai model
Seeing Sound Hearing Sight Uncovering Modality Bias and Conflict of AI models in Sound Localization
Imagine hearing a dog bark and instinctively turning toward the sound--only to find a parked car, while a silent dog sits nearby. Such moments of sensory conflict challenge perception, yet humans flexibly resolve these discrepancies, prioritizing auditory cues over misleading visuals to accurately localize sounds. Despite the rapid advancement of multimodal AI models that integrate vision and sound, little is known about how these systems handle cross-modal conflicts or whether they favor one modality over another. Here, we systematically and quantitatively examine modality bias and conflict resolution in AI models for Sound Source Localization (SSL). We evaluate a wide range of state-of-the-art multimodal models and compare them against human performance in psychophysics experiments spanning six audiovisual conditions, including congruent, conflicting, and absent visual and audio cues.
Unveiling the Uncertainty in Embodied and Operational Carbon of Large AIModels through a Probabilistic Carbon Accounting Model
The rapid growth of large AI models has raised significant environmental concerns due to their substantial carbon footprint. Existing carbon accounting methods for AI models are fundamentally deterministic and fail to account for inherent uncertainties in embodied and operational carbon emissions. Our work aims to investigate the effect of these uncertainties on embodied and operational carbon footprint estimates for large AI models. We propose a Probabilistic Carbon Accounting Model (PCAM), which quantifies uncertainties in the carbon accounting of large AI models. We develop parameter models to quantify key components (processors, memory, storage) in the carbon footprint of AI models. To characterize the distribution of the parameters, we develop a carbon dataset by aggregating related data from various sources. Then, we generate the probabilistic distribution of the parameters from the collected dataset. We compare the performance of PCAM with LLMCarbon, the state-of-the-art carbon accounting method for large AI models.
The White House Is Making Up Its Rules for AI in Real Time
Anthropic still can't distribute Claude Mythos or Fable 5 after running afoul of the Trump administration. But no one can say exactly what the company did wrong. It's been nearly a week since the Trump administration sent an export control directive to Anthropic, forcing one of the world's leading AI labs to pull its most advanced models offline. After days of negotiations between Anthropic and the White House, the two still remain at odds about how to bring Claude Mythos and Fable 5 back. Well, it depends whom you ask.
The Download: a new hunt for dark matter and Kenya's case for going solar
Plus: The Pentagon says it used Grok in strikes on Iran. For decades, physicists have hunted for weakly interacting massive particles (WIMPs), a leading candidate for dark matter. But their search has run into a new problem: neutrinos. These tiny particles from the sun and other stars can create a "neutrino fog" that drowns out any signal of dark matter. Hitting the neutrino fog does not, however, mean an end to the search. Researchers just have to shift the focus of their hunt.
ChatGPT can be made to generate sexualised and violent images, researchers find
The latest public version of ChatGPT can be made to generate sexualised images or depict scenes of graphic violence with a simple prompt, researchers have told the BBC. British AI security startup Mindgard figured out how to make ChatGPT create graphic pictures by slightly altering a widely-shared instruction, or prompt, which was originally designed to produce humorous results. After being contacted by the BBC, ChatGPT's maker OpenAI said it had taken action to stop the chatbot responding with those types of images. After investigating this trend, we've introduced additional safeguards against this type of prompt, it said in a statement. It also said it has multiple layers of protection to prevent users making content which breaches its terms and conditions.
The White House Wants Anthropic to Block All Jailbreaks. That May Not Be Possible
Trump administration officials tell WIRED that if Anthropic wants to rerelease Fable 5, it will need to ensure the model's guardrails can't be circumvented. Security experts say that can't be done. The Trump administration's disagreement with Anthropic over its most advanced AI models appears to be fast coming to a head. Trump officials tell Inner Loop that if Anthropic wants to rerelease Claude Fable 5, the AI model that they took offline with export controls last week over concerns about jailbreaking--a method of using prompts to get around a model's safeguards--the company will need to take steps to actually address what the government alleges are vulnerabilities. Anthropic has said for days that the administration's concerns are overblown and that the effects of the jailbreaks are minimal.
'Dangerous' AI Models Are Coming No Matter What
'Dangerous' AI Models Are Coming No Matter What The US government crackdown on Anthropic's Claude Fable 5 and Mythos 5 hides a glaring truth: AI models with advanced hacking capabilities will soon be the norm. Late last week, Anthropic took its new Claude Fable 5 and Mythos 5 AI models offline following a United States government export-control directive barring "any foreign national" from using the services. The company has been in talks with the White House since Friday but has yet to secure an agreement that would allow it to reinstate the offerings. Since Mythos debuted in April, Anthropic has claimed--and warned--that the model has advanced capabilities for not only finding software vulnerabilities to help defenders patch them, but also figuring out ways to exploit them that could be used by bad actors. Anthropic itself noted this double edged sword in its launch of Mythos 5 and Claude Fable 5. "A great deal of advanced usage of AI models is dual use: the same queries that are beneficial in the hands of cybersecurity professionals and biology researchers could be dangerous if available to malicious actors," the company wrote in a blog post last week.
Assume You Will Be Hacked
AI is enabling a deluge of cyberattacks the likes of which we've never seen before. Late last month, I began to consider withdrawing some money from my savings account to buy gold. It's the first time I've ever thought about panic-buying. For all of the firewalls and two-factor-authentication codes, the safety of the internet is starting to falter. Hackers are gaining the upper hand over organizations around the world--hospitals, energy grids, government agencies, and, yes, banks.
Elon Musk's unprecendented accumulation of wealth
IPO mints Musk as world's first trillionaire - now SpaceX is public, it will be harder than ever not to have a stake in its future I'm filling in for your usual host Blake Montgomery, who is out this week on vacation. Today, we'll be talking about the historic SpaceX IPO and the US government's surprise order to limit the use of Anthropic's most advanced AI model over cybersecurity concerns. Elon Musk's SpaceX hit the market on Friday in the biggest IPO of all time, raising $85.7bn and easily shattering the previous record of $29.4bn set by the Saudi oil giant Aramco. The rocket, AI and satellite communications company ended the day at $160.95 per share, up from its IPO price of $135 and satisfying any Wall Street skepticism over the unorthodox rollout of the stock. SpaceX's successful market debut turned Musk into the world's first trillionaire, an unprecedented accumulation of wealth that supporters touted as a testament to his financial genius and critics denounced as a symbol of a broken economic system.